Fill-In-The-Functions: Towards Establishing A Workload For Ranking in Web Databases

نویسندگان

  • Aditya Telang
  • Sharma Chakravarthy
  • Chengkai Li
چکیده

The emergence of the deep Web databases have given a new connotation to the concept of ranking query results. Earlier approaches for ranking resorted to analyzing frequencies of database values and query logs or establishing user profiles. In contrast, an integrated approach, based on the notion of a similarity model, for supporting useras well as query-dependent ranking has been recently proposed [28]. An important component of this ranking framework is a workload of ranking functions, where each function represents an individual user’s preferences towards the results of a specific query. At the time of answering a query for which no prior ranking function exists, the similarity model can ensure a good quality of ranking only if a ranking function for a very similar user-query pair exists in the workload. Thus, when there exists no function corresponding to a user asking a query, the framework must ensure that the workload contains a ranking function for at least one user-query pair similar to this pair. In this work, we address this problem of establishing an appropriate workload of ranking functions to support userand query-dependent ranking on Web databases. Toward this, we propose a novel metric, termed as Workload Goodness, that determines the appropriateness of a given set of user-query pairs in assisting the Similarity model. We then prove that the optimal solution to this problem is NPcomplete, and propose two distinct algorithms based on greedy approaches for individually determining an appropriate set of user-query pairs. We discuss the effectiveness of our proposal analytically as well as experimentally over two real Web databases.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

One Size Does Not Fit All: Towards User & Query Dependent Ranking For Web Databases

In this paper, we propose an automated solution for ranking query results of Web databases in an userand query-dependent environment. We first propose a learning method for inferring a workload of ranking functions by investigating users’ browsing choices over individual query results. Based on this workload, we propose a similarity model, based on two novel metrics – userand querysimilarity, f...

متن کامل

Towards Supporting Exploratory Search over the Arabic Web Content: The Case of ArabXplore

Due to the huge amount of data published on the Web, the Web search process has become more difficult, and it is sometimes hard to get the expected results, especially when the users are less certain about their information needs. Several efforts have been proposed to support exploratory search on the web by using query expansion, faceted search, or supplementary information extracted from exte...

متن کامل

RankingFor Web Databases Using SVM and K-Means algorithm

The Usage of internet in now a day is more and it became necessity for the people to do some applications such as searching web data bases in domains like Animation, vehicles, Movie, Real estates, etc. One of the problems in this context is ranking the results of a user query information. Earlier approaches problem have toused frequencies of database value regions, handling query logs, and user...

متن کامل

Ranking Factors Contributing to Medication Error Incidents in Private Hospital: A Nurse’s Perspective

Background and Objectives: Medication error is the most important factor threatening patient safety. While the causes of medication error are extensively researched in the public hospitals, the corresponding data from the private heath settings is limited. To help fill this shortcoming, this paper presents and discusses the results of surveying the medication error factors as perceived by nurse...

متن کامل

Ranking Iranian biomedical research centers according to H-variants (G, M, A, R) in Scopus and Web of Science

  Background: We conducted a comprehensive bibliometrics analysis to calculate the H, G, M, A and R indicators for all Iranian biomedical research centers (IBRCs) from the output of ISI Web of Science (WoS) and Scopus between 1991 and 2010. We compared the research performance of the research centers according to these indicators.   Methods : This was a cross-sectional and descriptive-analytica...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010